Overview

Dataset statistics

Number of variables21
Number of observations34857
Missing cells100975
Missing cells (%)13.8%
Duplicate rows1
Duplicate rows (%)< 0.1%
Total size in memory21.6 MiB
Average record size in memory650.7 B

Variable types

NUM13
CAT8

Reproduction

Analysis started2020-08-07 20:24:02.285397
Analysis finished2020-08-07 20:27:32.135975
Duration3 minutes and 29.85 seconds
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

Dataset has 1 (< 0.1%) duplicate rows Duplicates
Suburb has a high cardinality: 351 distinct values High cardinality
Address has a high cardinality: 34009 distinct values High cardinality
SellerG has a high cardinality: 388 distinct values High cardinality
Date has a high cardinality: 78 distinct values High cardinality
Bedroom2 is highly correlated with RoomsHigh correlation
Rooms is highly correlated with Bedroom2High correlation
Price has 7610 (21.8%) missing values Missing
Bedroom2 has 8217 (23.6%) missing values Missing
Bathroom has 8226 (23.6%) missing values Missing
Car has 8728 (25.0%) missing values Missing
Landsize has 11810 (33.9%) missing values Missing
BuildingArea has 21115 (60.6%) missing values Missing
YearBuilt has 19306 (55.4%) missing values Missing
Lattitude has 7976 (22.9%) missing values Missing
Longtitude has 7976 (22.9%) missing values Missing
Landsize is highly skewed (γ1 = 96.02231136) Skewed
BuildingArea is highly skewed (γ1 = 99.13257937) Skewed
Address is uniformly distributed Uniform
Car has 1631 (4.7%) zeros Zeros
Landsize has 2437 (7.0%) zeros Zeros

Variables

Suburb
Categorical

HIGH CARDINALITY

Distinct count351
Unique (%)1.0%
Missing0
Missing (%)0.0%
Memory size272.4 KiB
Reservoir
 
844
Bentleigh East
 
583
Richmond
 
552
Glen Iris
 
491
Preston
 
485
Other values (346)
31902
ValueCountFrequency (%) 
Reservoir8442.4%
 
Bentleigh East5831.7%
 
Richmond5521.6%
 
Glen Iris4911.4%
 
Preston4851.4%
 
Kew4671.3%
 
Brighton4561.3%
 
Brunswick4441.3%
 
South Yarra4351.2%
 
Hawthorn4281.2%
 
Northcote4241.2%
 
Camberwell4231.2%
 
Balwyn North4201.2%
 
Essendon4091.2%
 
Coburg4051.2%
 
Glenroy4001.1%
 
Brighton East3931.1%
 
Pascoe Vale3781.1%
 
St Kilda3741.1%
 
Port Melbourne3711.1%
 
Malvern East3691.1%
 
Prahran3361.0%
 
Thornbury3220.9%
 
Bentleigh3190.9%
 
Balwyn3190.9%
 
Other values (326)2401068.9%
 

Length

Max length18
Median length9
Mean length9.819175488
Min length3

Overview of Unicode Properties

Unique unicode characters49
Unique unicode categories (?)3
Unique unicode scripts (?)2
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
e307899.0%
 
r288968.4%
 
o288248.4%
 
n249637.3%
 
a235486.9%
 
t207176.1%
 
l191895.6%
 
i158334.6%
 
s154954.5%
 
132503.9%
 
h113193.3%
 
u87752.6%
 
d77752.3%
 
g67642.0%
 
w66491.9%
 
b63081.8%
 
B50691.5%
 
y47471.4%
 
c44581.3%
 
m43731.3%
 
M43351.3%
 
k41801.2%
 
E40891.2%
 
S36791.1%
 
C36051.1%
 
Other values (24)3463810.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter28078382.0%
 
Uppercase Letter4823414.1%
 
Space Separator132503.9%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
B506910.5%
 
M43359.0%
 
E40898.5%
 
S36797.6%
 
C36057.5%
 
H34797.2%
 
P31076.4%
 
N28055.8%
 
W27145.6%
 
A23094.8%
 
R19854.1%
 
K19104.0%
 
G16583.4%
 
F15513.2%
 
T12592.6%
 
D9782.0%
 
V9492.0%
 
Y7901.6%
 
I7821.6%
 
O5741.2%
 
L5481.1%
 
J550.1%
 
U4< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
e3078911.0%
 
r2889610.3%
 
o2882410.3%
 
n249638.9%
 
a235488.4%
 
t207177.4%
 
l191896.8%
 
i158335.6%
 
s154955.5%
 
h113194.0%
 
u87753.1%
 
d77752.8%
 
g67642.4%
 
w66492.4%
 
b63082.2%
 
y47471.7%
 
c44581.6%
 
m43731.6%
 
k41801.5%
 
v34311.2%
 
p22110.8%
 
f9590.3%
 
z3350.1%
 
x2290.1%
 
j16< 0.1%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
13250100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin32901796.1%
 
Common132503.9%
 

Most frequent Latin characters

ValueCountFrequency (%) 
e307899.4%
 
r288968.8%
 
o288248.8%
 
n249637.6%
 
a235487.2%
 
t207176.3%
 
l191895.8%
 
i158334.8%
 
s154954.7%
 
h113193.4%
 
u87752.7%
 
d77752.4%
 
g67642.1%
 
w66492.0%
 
b63081.9%
 
B50691.5%
 
y47471.4%
 
c44581.4%
 
m43731.3%
 
M43351.3%
 
k41801.3%
 
E40891.2%
 
S36791.1%
 
C36051.1%
 
H34791.1%
 
Other values (23)311599.5%
 

Most frequent Common characters

ValueCountFrequency (%) 
13250100.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII342267100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
e307899.0%
 
r288968.4%
 
o288248.4%
 
n249637.3%
 
a235486.9%
 
t207176.1%
 
l191895.6%
 
i158334.6%
 
s154954.5%
 
132503.9%
 
h113193.3%
 
u87752.6%
 
d77752.3%
 
g67642.0%
 
w66491.9%
 
b63081.8%
 
B50691.5%
 
y47471.4%
 
c44581.3%
 
m43731.3%
 
M43351.3%
 
k41801.2%
 
E40891.2%
 
S36791.1%
 
C36051.1%
 
Other values (24)3463810.1%
 

Address
Categorical

HIGH CARDINALITY
UNIFORM

Distinct count34009
Unique (%)97.6%
Missing0
Missing (%)0.0%
Memory size272.4 KiB
5 Charles St
 
6
25 William St
 
4
3 Charles St
 
3
13 George St
 
3
2 George St
 
3
Other values (34004)
34838
ValueCountFrequency (%) 
5 Charles St6< 0.1%
 
25 William St4< 0.1%
 
3 Charles St3< 0.1%
 
13 George St3< 0.1%
 
2 George St3< 0.1%
 
28 Blair St3< 0.1%
 
5 Margaret St3< 0.1%
 
33 McCracken St3< 0.1%
 
39 Moore St3< 0.1%
 
14 James St3< 0.1%
 
176 Darebin Rd3< 0.1%
 
57 Bay Rd3< 0.1%
 
7 Churchill Av3< 0.1%
 
54 Charles St3< 0.1%
 
13 Robinson St3< 0.1%
 
7 Hope St3< 0.1%
 
2 Bruce St3< 0.1%
 
16 Smith St3< 0.1%
 
3 Donald St3< 0.1%
 
14 Arthur St3< 0.1%
 
36 Aberfeldie St3< 0.1%
 
1/1 Clarendon St3< 0.1%
 
1 Bruce St3< 0.1%
 
21 May St3< 0.1%
 
14 Rose St3< 0.1%
 
Other values (33984)3477899.8%
 

Length

Max length27
Median length13
Mean length13.55136701
Min length8

Overview of Unicode Properties

Unique unicode characters64
Unique unicode categories (?)5
Unique unicode scripts (?)2
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
7019914.9%
 
t297206.3%
 
e247745.2%
 
r228744.8%
 
a220754.7%
 
S197034.2%
 
n189674.0%
 
1180813.8%
 
o170693.6%
 
l166093.5%
 
d146733.1%
 
i130902.8%
 
2125372.7%
 
/101952.2%
 
397512.1%
 
s88611.9%
 
R84451.8%
 
478611.7%
 
569111.5%
 
C64631.4%
 
h64071.4%
 
A60761.3%
 
659341.3%
 
v57221.2%
 
y54661.2%
 
Other values (39)8389717.8%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter23899550.6%
 
Decimal Number8143617.2%
 
Uppercase Letter7153515.1%
 
Space Separator7019914.9%
 
Other Punctuation101952.2%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
11808122.2%
 
21253715.4%
 
3975112.0%
 
478619.7%
 
569118.5%
 
659347.3%
 
754546.7%
 
053286.5%
 
850916.3%
 
944885.5%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
70199100.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
S1970327.5%
 
R844511.8%
 
C64639.0%
 
A60768.5%
 
B36795.1%
 
M32464.5%
 
D29204.1%
 
P27823.9%
 
G26493.7%
 
W25473.6%
 
H22873.2%
 
L18622.6%
 
T15512.2%
 
E12891.8%
 
K11771.6%
 
F10681.5%
 
N10501.5%
 
O6811.0%
 
V6620.9%
 
J4960.7%
 
I3280.5%
 
Y2390.3%
 
Q1790.3%
 
U1190.2%
 
Z31< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
t2972012.4%
 
e2477410.4%
 
r228749.6%
 
a220759.2%
 
n189677.9%
 
o170697.1%
 
l166096.9%
 
d146736.1%
 
i130905.5%
 
s88613.7%
 
h64072.7%
 
v57222.4%
 
y54662.3%
 
u53532.2%
 
m46031.9%
 
c45671.9%
 
g43771.8%
 
w32441.4%
 
b32011.3%
 
k31071.3%
 
p18410.8%
 
f14930.6%
 
x5050.2%
 
z2520.1%
 
q77< 0.1%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
/10195100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin31053065.7%
 
Common16183034.3%
 

Most frequent Common characters

ValueCountFrequency (%) 
7019943.4%
 
11808111.2%
 
2125377.7%
 
/101956.3%
 
397516.0%
 
478614.9%
 
569114.3%
 
659343.7%
 
754543.4%
 
053283.3%
 
850913.1%
 
944882.8%
 

Most frequent Latin characters

ValueCountFrequency (%) 
t297209.6%
 
e247748.0%
 
r228747.4%
 
a220757.1%
 
S197036.3%
 
n189676.1%
 
o170695.5%
 
l166095.3%
 
d146734.7%
 
i130904.2%
 
s88612.9%
 
R84452.7%
 
C64632.1%
 
h64072.1%
 
A60762.0%
 
v57221.8%
 
y54661.8%
 
u53531.7%
 
m46031.5%
 
c45671.5%
 
g43771.4%
 
B36791.2%
 
M32461.0%
 
w32441.0%
 
b32011.0%
 
Other values (27)3126610.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII472360100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
7019914.9%
 
t297206.3%
 
e247745.2%
 
r228744.8%
 
a220754.7%
 
S197034.2%
 
n189674.0%
 
1180813.8%
 
o170693.6%
 
l166093.5%
 
d146733.1%
 
i130902.8%
 
2125372.7%
 
/101952.2%
 
397512.1%
 
s88611.9%
 
R84451.8%
 
478611.7%
 
569111.5%
 
C64631.4%
 
h64071.4%
 
A60761.3%
 
659341.3%
 
v57221.2%
 
y54661.2%
 
Other values (39)8389717.8%
 

Rooms
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count12
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.0310124221820582
Minimum1
Maximum16
Zeros0
Zeros (%)0.0%
Memory size272.4 KiB

Quantile statistics

Minimum1
5-th percentile2
Q12
median3
Q34
95-th percentile5
Maximum16
Range15
Interquartile range (IQR)2

Descriptive statistics

Standard deviation0.9699329349
Coefficient of variation (CV)0.320002956
Kurtosis2.511708654
Mean3.031012422
Median Absolute Deviation (MAD)1
Skewness0.4990968808
Sum105652
Variance0.9407698982
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
31508443.3%
 
2833223.9%
 
4795622.8%
 
517375.0%
 
114794.2%
 
62040.6%
 
7320.1%
 
8190.1%
 
106< 0.1%
 
94< 0.1%
 
123< 0.1%
 
161< 0.1%
 
ValueCountFrequency (%) 
114794.2%
 
2833223.9%
 
31508443.3%
 
4795622.8%
 
517375.0%
 
62040.6%
 
7320.1%
 
8190.1%
 
94< 0.1%
 
106< 0.1%
 
ValueCountFrequency (%) 
161< 0.1%
 
123< 0.1%
 
106< 0.1%
 
94< 0.1%
 
8190.1%
 
7320.1%
 
62040.6%
 
517375.0%
 
4795622.8%
 
31508443.3%
 

Type
Categorical

Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size272.4 KiB
h
23980
u
7297
t
 
3580
ValueCountFrequency (%) 
h2398068.8%
 
u729720.9%
 
t358010.3%
 

Length

Max length1
Median length1
Mean length1
Min length1

Overview of Unicode Properties

Unique unicode characters3
Unique unicode categories (?)1
Unique unicode scripts (?)1
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
h2398068.8%
 
u729720.9%
 
t358010.3%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter34857100.0%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
h2398068.8%
 
u729720.9%
 
t358010.3%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin34857100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
h2398068.8%
 
u729720.9%
 
t358010.3%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII34857100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
h2398068.8%
 
u729720.9%
 
t358010.3%
 

Price
Real number (ℝ≥0)

MISSING

Distinct count2871
Unique (%)10.5%
Missing7610
Missing (%)21.8%
Infinite0
Infinite (%)0.0%
Mean1050173.344955408
Minimum85000.0
Maximum11200000.0
Zeros0
Zeros (%)0.0%
Memory size272.4 KiB

Quantile statistics

Minimum85000
5-th percentile415000
Q1635000
median870000
Q31295000
95-th percentile2250000
Maximum11200000
Range11115000
Interquartile range (IQR)660000

Descriptive statistics

Standard deviation641467.1301
Coefficient of variation (CV)0.6108202357
Kurtosis13.09720052
Mean1050173.345
Median Absolute Deviation (MAD)290000
Skewness2.588969341
Sum2.861407313e+10
Variance4.11480079e+11
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
6000002350.7%
 
11000002350.7%
 
6500002190.6%
 
8000002170.6%
 
13000002100.6%
 
10000002050.6%
 
12000002040.6%
 
7000001970.6%
 
7500001940.6%
 
9000001910.5%
 
8500001760.5%
 
9500001720.5%
 
12500001580.5%
 
15000001530.4%
 
5000001530.4%
 
14000001500.4%
 
11500001490.4%
 
5500001410.4%
 
10500001390.4%
 
7800001360.4%
 
7700001310.4%
 
7200001280.4%
 
13500001250.4%
 
6300001240.4%
 
7300001240.4%
 
Other values (2846)2298165.9%
 
(Missing)761021.8%
 
ValueCountFrequency (%) 
850001< 0.1%
 
1120001< 0.1%
 
1210001< 0.1%
 
1310001< 0.1%
 
1450002< 0.1%
 
1600001< 0.1%
 
1700002< 0.1%
 
1850002< 0.1%
 
1900001< 0.1%
 
2000002< 0.1%
 
ValueCountFrequency (%) 
112000001< 0.1%
 
90000001< 0.1%
 
80000001< 0.1%
 
76500001< 0.1%
 
70000001< 0.1%
 
68000001< 0.1%
 
66000001< 0.1%
 
65000002< 0.1%
 
64600001< 0.1%
 
64000002< 0.1%
 

Method
Categorical

Distinct count9
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size272.4 KiB
S
19744
SP
5095
PI
4850
VB
 
3108
SN
 
1317
Other values (4)
 
743
ValueCountFrequency (%) 
S1974456.6%
 
SP509514.6%
 
PI485013.9%
 
VB31088.9%
 
SN13173.8%
 
PN3080.9%
 
SA2260.6%
 
W1730.5%
 
SS360.1%
 

Length

Max length2
Median length1
Mean length1.428608314
Min length1

Overview of Unicode Properties

Unique unicode characters8
Unique unicode categories (?)1
Unique unicode scripts (?)1
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
S2645453.1%
 
P1025320.6%
 
I48509.7%
 
V31086.2%
 
B31086.2%
 
N16253.3%
 
A2260.5%
 
W1730.3%
 

Most occurring categories

ValueCountFrequency (%) 
Uppercase Letter49797100.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
S2645453.1%
 
P1025320.6%
 
I48509.7%
 
V31086.2%
 
B31086.2%
 
N16253.3%
 
A2260.5%
 
W1730.3%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin49797100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
S2645453.1%
 
P1025320.6%
 
I48509.7%
 
V31086.2%
 
B31086.2%
 
N16253.3%
 
A2260.5%
 
W1730.3%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII49797100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
S2645453.1%
 
P1025320.6%
 
I48509.7%
 
V31086.2%
 
B31086.2%
 
N16253.3%
 
A2260.5%
 
W1730.3%
 

SellerG
Categorical

HIGH CARDINALITY

Distinct count388
Unique (%)1.1%
Missing0
Missing (%)0.0%
Memory size272.4 KiB
Jellis
 
3359
Nelson
 
3236
Barry
 
3235
hockingstuart
 
2623
Marshall
 
2027
Other values (383)
20377
ValueCountFrequency (%) 
Jellis33599.6%
 
Nelson32369.3%
 
Barry32359.3%
 
hockingstuart26237.5%
 
Marshall20275.8%
 
Ray19505.6%
 
Buxton18685.4%
 
Biggin8972.6%
 
Fletchers8612.5%
 
Woodards7142.0%
 
Brad7012.0%
 
McGrath6021.7%
 
Noel5241.5%
 
Greg5191.5%
 
RT5161.5%
 
Miles4781.4%
 
YPA4731.4%
 
Jas4571.3%
 
Harcourts4471.3%
 
Stockdale4201.2%
 
Hodges4131.2%
 
Gary4131.2%
 
Sweeney4111.2%
 
Kay3601.0%
 
Raine3260.9%
 
Other values (363)702720.2%
 

Length

Max length27
Median length6
Mean length6.291533982
Min length1

Overview of Unicode Properties

Unique unicode characters58
Unique unicode categories (?)4
Unique unicode scripts (?)2
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
l194528.9%
 
a190368.7%
 
r183568.4%
 
s165927.6%
 
e156937.2%
 
o135506.2%
 
n123845.6%
 
i119725.5%
 
t101324.6%
 
B74333.4%
 
h73483.4%
 
y67523.1%
 
g62032.8%
 
u57492.6%
 
c57382.6%
 
J40471.8%
 
N40091.8%
 
R38061.7%
 
k37391.7%
 
M37151.7%
 
d36981.7%
 
x19900.9%
 
G16310.7%
 
W14600.7%
 
H13680.6%
 
Other values (33)134516.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter18158082.8%
 
Uppercase Letter3686416.8%
 
Other Punctuation5420.2%
 
Decimal Number3180.1%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
B743320.2%
 
J404711.0%
 
N400910.9%
 
R380610.3%
 
M371510.1%
 
G16314.4%
 
W14604.0%
 
H13683.7%
 
P11493.1%
 
F10682.9%
 
A10562.9%
 
S9992.7%
 
C9422.6%
 
T9132.5%
 
L7702.1%
 
D5351.5%
 
Y4751.3%
 
K4111.1%
 
O3160.9%
 
E3070.8%
 
V2340.6%
 
I1830.5%
 
X18< 0.1%
 
U17< 0.1%
 
Z1< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
l1945210.7%
 
a1903610.5%
 
r1835610.1%
 
s165929.1%
 
e156938.6%
 
o135507.5%
 
n123846.8%
 
i119726.6%
 
t101325.6%
 
h73484.0%
 
y67523.7%
 
g62033.4%
 
u57493.2%
 
c57383.2%
 
k37392.1%
 
d36982.0%
 
x19901.1%
 
m9120.5%
 
w7290.4%
 
p4760.3%
 
b4600.3%
 
v4310.2%
 
z1020.1%
 
f71< 0.1%
 
q15< 0.1%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
'31858.7%
 
.10118.6%
 
&7213.3%
 
/397.2%
 
@122.2%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
215950.0%
 
115950.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin21844499.6%
 
Common8600.4%
 

Most frequent Latin characters

ValueCountFrequency (%) 
l194528.9%
 
a190368.7%
 
r183568.4%
 
s165927.6%
 
e156937.2%
 
o135506.2%
 
n123845.7%
 
i119725.5%
 
t101324.6%
 
B74333.4%
 
h73483.4%
 
y67523.1%
 
g62032.8%
 
u57492.6%
 
c57382.6%
 
J40471.9%
 
N40091.8%
 
R38061.7%
 
k37391.7%
 
M37151.7%
 
d36981.7%
 
x19900.9%
 
G16310.7%
 
W14600.7%
 
H13680.6%
 
Other values (26)125915.8%
 

Most frequent Common characters

ValueCountFrequency (%) 
'31837.0%
 
215918.5%
 
115918.5%
 
.10111.7%
 
&728.4%
 
/394.5%
 
@121.4%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII219304100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
l194528.9%
 
a190368.7%
 
r183568.4%
 
s165927.6%
 
e156937.2%
 
o135506.2%
 
n123845.6%
 
i119725.5%
 
t101324.6%
 
B74333.4%
 
h73483.4%
 
y67523.1%
 
g62032.8%
 
u57492.6%
 
c57382.6%
 
J40471.8%
 
N40091.8%
 
R38061.7%
 
k37391.7%
 
M37151.7%
 
d36981.7%
 
x19900.9%
 
G16310.7%
 
W14600.7%
 
H13680.6%
 
Other values (33)134516.1%
 

Date
Categorical

HIGH CARDINALITY

Distinct count78
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size272.4 KiB
28/10/2017
 
1119
17/03/2018
 
970
24/02/2018
 
941
9/12/2017
 
927
25/11/2017
 
902
Other values (73)
29998
ValueCountFrequency (%) 
28/10/201711193.2%
 
17/03/20189702.8%
 
24/02/20189412.7%
 
9/12/20179272.7%
 
25/11/20179022.6%
 
18/11/20178662.5%
 
3/03/20188462.4%
 
6/01/20187872.3%
 
27/05/20177702.2%
 
23/09/20177422.1%
 
16/09/20177302.1%
 
11/11/20177292.1%
 
3/06/20176892.0%
 
14/10/20176781.9%
 
21/10/20176581.9%
 
26/08/20176471.9%
 
17/06/20176371.8%
 
24/06/20176071.7%
 
9/09/20175981.7%
 
7/10/20175931.7%
 
27/11/20165751.6%
 
3/09/20175671.6%
 
25/02/20175551.6%
 
12/08/20175461.6%
 
4/03/20175321.5%
 
Other values (53)1664647.8%
 

Length

Max length10
Median length10
Mean length9.714748831
Min length9

Overview of Unicode Properties

Unique unicode characters11
Unique unicode categories (?)2
Unique unicode scripts (?)1
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
/6971420.6%
 
06558819.4%
 
16487919.2%
 
25391415.9%
 
7283398.4%
 
6168765.0%
 
8126533.7%
 
379762.4%
 
974362.2%
 
557601.7%
 
454921.6%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number26891379.4%
 
Other Punctuation6971420.6%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
06558824.4%
 
16487924.1%
 
25391420.0%
 
72833910.5%
 
6168766.3%
 
8126534.7%
 
379763.0%
 
974362.8%
 
557602.1%
 
454922.0%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
/69714100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Common338627100.0%
 

Most frequent Common characters

ValueCountFrequency (%) 
/6971420.6%
 
06558819.4%
 
16487919.2%
 
25391415.9%
 
7283398.4%
 
6168765.0%
 
8126533.7%
 
379762.4%
 
974362.2%
 
557601.7%
 
454921.6%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII338627100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
/6971420.6%
 
06558819.4%
 
16487919.2%
 
25391415.9%
 
7283398.4%
 
6168765.0%
 
8126533.7%
 
379762.4%
 
974362.2%
 
557601.7%
 
454921.6%
 

Distance
Real number (ℝ≥0)

Distinct count215
Unique (%)0.6%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean11.184929423915538
Minimum0.0
Maximum48.1
Zeros77
Zeros (%)0.2%
Memory size272.4 KiB

Quantile statistics

Minimum0
5-th percentile2.7
Q16.4
median10.3
Q314
95-th percentile24.7
Maximum48.1
Range48.1
Interquartile range (IQR)7.6

Descriptive statistics

Standard deviation6.788892456
Coefficient of variation (CV)0.6069678403
Kurtosis3.585924276
Mean11.18492942
Median Absolute Deviation (MAD)3.85
Skewness1.503585816
Sum389861.9
Variance46.08906078
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
11.214204.1%
 
13.86812.0%
 
9.26651.9%
 
7.86621.9%
 
10.56601.9%
 
8.46041.7%
 
4.65851.7%
 
14.75661.6%
 
5.25651.6%
 
11.45211.5%
 
13.94991.4%
 
9.74701.3%
 
7.54681.3%
 
6.44591.3%
 
5.34371.3%
 
134271.2%
 
144251.2%
 
6.24031.2%
 
124001.1%
 
16.73941.1%
 
8.83891.1%
 
7.73841.1%
 
6.33661.1%
 
20.63561.0%
 
5.93531.0%
 
Other values (190)2169762.2%
 
ValueCountFrequency (%) 
0770.2%
 
0.7290.1%
 
1.2470.1%
 
1.3300.1%
 
1.46< 0.1%
 
1.5290.1%
 
1.61940.6%
 
1.81520.4%
 
1.91480.4%
 
2520.1%
 
ValueCountFrequency (%) 
48.16< 0.1%
 
47.47< 0.1%
 
47.3200.1%
 
45.9330.1%
 
45.22< 0.1%
 
44.2200.1%
 
43.45< 0.1%
 
43.36< 0.1%
 
41180.1%
 
39.82< 0.1%
 

Postcode
Real number (ℝ≥0)

Distinct count211
Unique (%)0.6%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean3116.062858618315
Minimum3000.0
Maximum3978.0
Zeros0
Zeros (%)0.0%
Memory size272.4 KiB

Quantile statistics

Minimum3000
5-th percentile3015
Q13051
median3103
Q33156
95-th percentile3204
Maximum3978
Range978
Interquartile range (IQR)105

Descriptive statistics

Standard deviation109.0239027
Coefficient of variation (CV)0.03498770971
Kurtosis22.78373808
Mean3116.062859
Median Absolute Deviation (MAD)52
Skewness4.018785705
Sum108613487
Variance11886.21137
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
30738442.4%
 
30466381.8%
 
30206171.8%
 
31216121.8%
 
31655831.7%
 
30585561.6%
 
30405351.5%
 
32045181.5%
 
31635081.5%
 
30124971.4%
 
30324971.4%
 
31464911.4%
 
30724851.4%
 
31814681.3%
 
31014671.3%
 
31864561.3%
 
30564441.3%
 
30844421.3%
 
31414351.2%
 
31224281.2%
 
30704241.2%
 
31244231.2%
 
31044201.2%
 
31274051.2%
 
31883951.1%
 
Other values (186)2226863.9%
 
ValueCountFrequency (%) 
30002040.6%
 
3002590.2%
 
3003660.2%
 
3006760.2%
 
300816< 0.1%
 
30113751.1%
 
30124971.4%
 
30133040.9%
 
30153451.0%
 
30162400.7%
 
ValueCountFrequency (%) 
39785< 0.1%
 
3977330.1%
 
39767< 0.1%
 
39752< 0.1%
 
3910180.1%
 
3810200.1%
 
38096< 0.1%
 
38082< 0.1%
 
38077< 0.1%
 
3806470.1%
 

Bedroom2
Real number (ℝ≥0)

HIGH CORRELATION
MISSING

Distinct count15
Unique (%)0.1%
Missing8217
Missing (%)23.6%
Infinite0
Infinite (%)0.0%
Mean3.0846471471471473
Minimum0.0
Maximum30.0
Zeros17
Zeros (%)< 0.1%
Memory size272.4 KiB

Quantile statistics

Minimum0
5-th percentile2
Q12
median3
Q34
95-th percentile5
Maximum30
Range30
Interquartile range (IQR)2

Descriptive statistics

Standard deviation0.9806897285
Coefficient of variation (CV)0.3179260647
Kurtosis26.80745531
Mean3.084647147
Median Absolute Deviation (MAD)1
Skewness1.406365679
Sum82175
Variance0.9617523437
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
31188134.1%
 
4634818.2%
 
2577716.6%
 
514274.1%
 
19662.8%
 
61680.5%
 
7300.1%
 
017< 0.1%
 
813< 0.1%
 
95< 0.1%
 
104< 0.1%
 
301< 0.1%
 
121< 0.1%
 
201< 0.1%
 
161< 0.1%
 
(Missing)821723.6%
 
ValueCountFrequency (%) 
017< 0.1%
 
19662.8%
 
2577716.6%
 
31188134.1%
 
4634818.2%
 
514274.1%
 
61680.5%
 
7300.1%
 
813< 0.1%
 
95< 0.1%
 
ValueCountFrequency (%) 
301< 0.1%
 
201< 0.1%
 
161< 0.1%
 
121< 0.1%
 
104< 0.1%
 
95< 0.1%
 
813< 0.1%
 
7300.1%
 
61680.5%
 
514274.1%
 

Bathroom
Real number (ℝ≥0)

MISSING

Distinct count11
Unique (%)< 0.1%
Missing8226
Missing (%)23.6%
Infinite0
Infinite (%)0.0%
Mean1.624798167549097
Minimum0.0
Maximum12.0
Zeros46
Zeros (%)0.1%
Memory size272.4 KiB

Quantile statistics

Minimum0
5-th percentile1
Q11
median2
Q32
95-th percentile3
Maximum12
Range12
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.7242120115
Coefficient of variation (CV)0.4457242911
Kurtosis4.861008943
Mean1.624798168
Median Absolute Deviation (MAD)1
Skewness1.356293032
Sum43270
Variance0.5244830376
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
11296937.2%
 
21106431.7%
 
321816.3%
 
42690.8%
 
5770.2%
 
0460.1%
 
616< 0.1%
 
74< 0.1%
 
83< 0.1%
 
121< 0.1%
 
91< 0.1%
 
(Missing)822623.6%
 
ValueCountFrequency (%) 
0460.1%
 
11296937.2%
 
21106431.7%
 
321816.3%
 
42690.8%
 
5770.2%
 
616< 0.1%
 
74< 0.1%
 
83< 0.1%
 
91< 0.1%
 
ValueCountFrequency (%) 
121< 0.1%
 
91< 0.1%
 
83< 0.1%
 
74< 0.1%
 
616< 0.1%
 
5770.2%
 
42690.8%
 
321816.3%
 
21106431.7%
 
11296937.2%
 

Car
Real number (ℝ≥0)

MISSING
ZEROS

Distinct count15
Unique (%)0.1%
Missing8728
Missing (%)25.0%
Infinite0
Infinite (%)0.0%
Mean1.7288453442535114
Minimum0.0
Maximum26.0
Zeros1631
Zeros (%)4.7%
Memory size272.4 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q32
95-th percentile4
Maximum26
Range26
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.010770785
Coefficient of variation (CV)0.5846507837
Kurtosis20.85932625
Mean1.728845344
Median Absolute Deviation (MAD)1
Skewness2.09517618
Sum45173
Variance1.021657581
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
21221435.0%
 
1916426.3%
 
016314.7%
 
316064.6%
 
411613.3%
 
51510.4%
 
61400.4%
 
7250.1%
 
8230.1%
 
106< 0.1%
 
93< 0.1%
 
112< 0.1%
 
261< 0.1%
 
121< 0.1%
 
181< 0.1%
 
(Missing)872825.0%
 
ValueCountFrequency (%) 
016314.7%
 
1916426.3%
 
21221435.0%
 
316064.6%
 
411613.3%
 
51510.4%
 
61400.4%
 
7250.1%
 
8230.1%
 
93< 0.1%
 
ValueCountFrequency (%) 
261< 0.1%
 
181< 0.1%
 
121< 0.1%
 
112< 0.1%
 
106< 0.1%
 
93< 0.1%
 
8230.1%
 
7250.1%
 
61400.4%
 
51510.4%
 

Landsize
Real number (ℝ≥0)

MISSING
SKEWED
ZEROS

Distinct count1684
Unique (%)7.3%
Missing11810
Missing (%)33.9%
Infinite0
Infinite (%)0.0%
Mean593.598993361392
Minimum0.0
Maximum433014.0
Zeros2437
Zeros (%)7.0%
Memory size272.4 KiB

Quantile statistics

Minimum0
5-th percentile0
Q1224
median521
Q3670
95-th percentile1001
Maximum433014
Range433014
Interquartile range (IQR)446

Descriptive statistics

Standard deviation3398.841946
Coefficient of variation (CV)5.725821614
Kurtosis11580.16251
Mean593.5989934
Median Absolute Deviation (MAD)210
Skewness96.02231136
Sum13680676
Variance11552126.58
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
024377.0%
 
6502040.6%
 
6971230.4%
 
585970.3%
 
700860.2%
 
604840.2%
 
534810.2%
 
696800.2%
 
652680.2%
 
600680.2%
 
400660.2%
 
590640.2%
 
581610.2%
 
557600.2%
 
530600.2%
 
653600.2%
 
660580.2%
 
602580.2%
 
603580.2%
 
654580.2%
 
651570.2%
 
613560.2%
 
531550.2%
 
649540.2%
 
725540.2%
 
Other values (1659)1884054.0%
 
(Missing)1181033.9%
 
ValueCountFrequency (%) 
024377.0%
 
13< 0.1%
 
21< 0.1%
 
32< 0.1%
 
51< 0.1%
 
101< 0.1%
 
141< 0.1%
 
152< 0.1%
 
171< 0.1%
 
181< 0.1%
 
ValueCountFrequency (%) 
4330141< 0.1%
 
1466991< 0.1%
 
890301< 0.1%
 
800001< 0.1%
 
760001< 0.1%
 
751001< 0.1%
 
445001< 0.1%
 
428001< 0.1%
 
414001< 0.1%
 
405001< 0.1%
 

BuildingArea
Real number (ℝ≥0)

MISSING
SKEWED

Distinct count740
Unique (%)5.4%
Missing21115
Missing (%)60.6%
Infinite0
Infinite (%)0.0%
Mean160.25640035657113
Minimum0.0
Maximum44515.0
Zeros76
Zeros (%)0.2%
Memory size272.4 KiB

Quantile statistics

Minimum0
5-th percentile56
Q1102
median136
Q3188
95-th percentile310
Maximum44515
Range44515
Interquartile range (IQR)86

Descriptive statistics

Standard deviation401.2670601
Coefficient of variation (CV)2.50390661
Kurtosis10877.52575
Mean160.2564004
Median Absolute Deviation (MAD)41
Skewness99.13257937
Sum2202243.454
Variance161015.2535
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1201850.5%
 
1001610.5%
 
1101590.5%
 
1301530.4%
 
1151490.4%
 
1401420.4%
 
1501360.4%
 
1601230.4%
 
1121230.4%
 
1251190.3%
 
1051100.3%
 
1801100.3%
 
1451020.3%
 
1081010.3%
 
1041010.3%
 
901000.3%
 
1021000.3%
 
801000.3%
 
132990.3%
 
135990.3%
 
133980.3%
 
170960.3%
 
95950.3%
 
118940.3%
 
123940.3%
 
Other values (715)1079331.0%
 
(Missing)2111560.6%
 
ValueCountFrequency (%) 
0760.2%
 
0.011< 0.1%
 
115< 0.1%
 
2200.1%
 
3250.1%
 
46< 0.1%
 
54< 0.1%
 
71< 0.1%
 
91< 0.1%
 
101< 0.1%
 
ValueCountFrequency (%) 
445151< 0.1%
 
67911< 0.1%
 
61781< 0.1%
 
46451< 0.1%
 
36471< 0.1%
 
35581< 0.1%
 
31121< 0.1%
 
20021< 0.1%
 
15611< 0.1%
 
11431< 0.1%
 

YearBuilt
Real number (ℝ≥0)

MISSING

Distinct count160
Unique (%)1.0%
Missing19306
Missing (%)55.4%
Infinite0
Infinite (%)0.0%
Mean1965.289884894862
Minimum1196.0
Maximum2106.0
Zeros0
Zeros (%)0.0%
Memory size272.4 KiB

Quantile statistics

Minimum1196
5-th percentile1900
Q11940
median1970
Q32000
95-th percentile2013
Maximum2106
Range910
Interquartile range (IQR)60

Descriptive statistics

Standard deviation37.32817802
Coefficient of variation (CV)0.01899372622
Kurtosis10.89861685
Mean1965.289885
Median Absolute Deviation (MAD)30
Skewness-1.080913147
Sum30562223
Variance1393.392875
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
197014904.3%
 
196012603.6%
 
195010893.1%
 
19807262.1%
 
19006061.7%
 
20005711.6%
 
19205451.6%
 
19305311.5%
 
19104601.3%
 
18904441.3%
 
19404061.2%
 
19753871.1%
 
20103651.0%
 
19903611.0%
 
20123331.0%
 
20052760.8%
 
19652600.7%
 
20132470.7%
 
20112410.7%
 
20092290.7%
 
19852290.7%
 
19952150.6%
 
20142120.6%
 
20082020.6%
 
20072000.6%
 
Other values (135)366610.5%
 
(Missing)1930655.4%
 
ValueCountFrequency (%) 
11961< 0.1%
 
18001< 0.1%
 
18201< 0.1%
 
18301< 0.1%
 
18504< 0.1%
 
18542< 0.1%
 
18551< 0.1%
 
18561< 0.1%
 
18572< 0.1%
 
186011< 0.1%
 
ValueCountFrequency (%) 
21061< 0.1%
 
20191< 0.1%
 
20184< 0.1%
 
2017820.2%
 
20161300.4%
 
20151560.4%
 
20142120.6%
 
20132470.7%
 
20123331.0%
 
20112410.7%
 

CouncilArea
Categorical

Distinct count33
Unique (%)0.1%
Missing3
Missing (%)< 0.1%
Memory size272.4 KiB
Boroondara City Council
 
3675
Darebin City Council
 
2851
Moreland City Council
 
2122
Glen Eira City Council
 
2006
Melbourne City Council
 
1952
Other values (28)
22248
ValueCountFrequency (%) 
Boroondara City Council367510.5%
 
Darebin City Council28518.2%
 
Moreland City Council21226.1%
 
Glen Eira City Council20065.8%
 
Melbourne City Council19525.6%
 
Banyule City Council18615.3%
 
Moonee Valley City Council17915.1%
 
Bayside City Council17645.1%
 
Brimbank City Council15934.6%
 
Monash City Council14664.2%
 
Stonnington City Council14604.2%
 
Maribyrnong City Council14514.2%
 
Port Phillip City Council12803.7%
 
Hume City Council12143.5%
 
Yarra City Council11863.4%
 
Manningham City Council10463.0%
 
Hobsons Bay City Council9422.7%
 
Kingston City Council8712.5%
 
Whittlesea City Council8282.4%
 
Wyndham City Council6241.8%
 
Whitehorse City Council6181.8%
 
Maroondah City Council5061.5%
 
Knox City Council3711.1%
 
Greater Dandenong City Council3140.9%
 
Melton City Council2920.8%
 
Other values (8)7702.2%
 

Length

Max length30
Median length22
Mean length21.73256448
Min length3

Overview of Unicode Properties

Unique unicode characters37
Unique unicode categories (?)3
Unique unicode scripts (?)2
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
i8703411.5%
 
7618910.1%
 
n722919.5%
 
C696219.2%
 
o663788.8%
 
l502806.6%
 
y431595.7%
 
t428115.7%
 
u399695.3%
 
c349204.6%
 
a337094.4%
 
r270263.6%
 
e259153.4%
 
M106991.4%
 
B98351.3%
 
d90921.2%
 
b88841.2%
 
s80451.1%
 
h73101.0%
 
g52900.7%
 
m45650.6%
 
D31650.4%
 
P25600.3%
 
G23200.3%
 
H21560.3%
 
Other values (12)143091.9%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter57030075.3%
 
Uppercase Letter11104314.7%
 
Space Separator7618910.1%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
C6962162.7%
 
M106999.6%
 
B98358.9%
 
D31652.9%
 
P25602.3%
 
G23202.1%
 
H21561.9%
 
W20701.9%
 
E20061.8%
 
V17911.6%
 
S17641.6%
 
Y12881.2%
 
K12421.1%
 
F2900.3%
 
R1480.1%
 
N880.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
i8703415.3%
 
n7229112.7%
 
o6637811.6%
 
l502808.8%
 
y431597.6%
 
t428117.5%
 
u399697.0%
 
c349206.1%
 
a337095.9%
 
r270264.7%
 
e259154.5%
 
d90921.6%
 
b88841.6%
 
s80451.4%
 
h73101.3%
 
g52900.9%
 
m45650.8%
 
k19710.3%
 
p12800.2%
 
x3710.1%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
76189100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin68134389.9%
 
Common7618910.1%
 

Most frequent Latin characters

ValueCountFrequency (%) 
i8703412.8%
 
n7229110.6%
 
C6962110.2%
 
o663789.7%
 
l502807.4%
 
y431596.3%
 
t428116.3%
 
u399695.9%
 
c349205.1%
 
a337094.9%
 
r270264.0%
 
e259153.8%
 
M106991.6%
 
B98351.4%
 
d90921.3%
 
b88841.3%
 
s80451.2%
 
h73101.1%
 
g52900.8%
 
m45650.7%
 
D31650.5%
 
P25600.4%
 
G23200.3%
 
H21560.3%
 
W20700.3%
 
Other values (11)122391.8%
 

Most frequent Common characters

ValueCountFrequency (%) 
76189100.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII757532100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
i8703411.5%
 
7618910.1%
 
n722919.5%
 
C696219.2%
 
o663788.8%
 
l502806.6%
 
y431595.7%
 
t428115.7%
 
u399695.3%
 
c349204.6%
 
a337094.4%
 
r270263.6%
 
e259153.4%
 
M106991.4%
 
B98351.3%
 
d90921.2%
 
b88841.2%
 
s80451.1%
 
h73101.0%
 
g52900.7%
 
m45650.6%
 
D31650.4%
 
P25600.3%
 
G23200.3%
 
H21560.3%
 
Other values (12)143091.9%
 

Lattitude
Real number (ℝ)

MISSING

Distinct count13402
Unique (%)49.9%
Missing7976
Missing (%)22.9%
Infinite0
Infinite (%)0.0%
Mean-37.810634295599115
Minimum-38.19043
Maximum-37.3902
Zeros0
Zeros (%)0.0%
Memory size272.4 KiB

Quantile statistics

Minimum-38.19043
5-th percentile-37.9485
Q1-37.86295
median-37.8076
Q3-37.7541
95-th percentile-37.67519
Maximum-37.3902
Range0.80023
Interquartile range (IQR)0.10885

Descriptive statistics

Standard deviation0.09027890451
Coefficient of variation (CV)-0.002387659086
Kurtosis1.544527049
Mean-37.8106343
Median Absolute Deviation (MAD)0.05448
Skewness-0.2576614223
Sum-1016387.66
Variance0.008150280599
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
-37.8361250.1%
 
-37.8424220.1%
 
-37.8198200.1%
 
-37.7956200.1%
 
-37.8414180.1%
 
-37.7969180.1%
 
-37.853617< 0.1%
 
-37.794117< 0.1%
 
-37.763417< 0.1%
 
-37.812716< 0.1%
 
-37.85816< 0.1%
 
-37.84716< 0.1%
 
-37.85116< 0.1%
 
-37.857316< 0.1%
 
-37.816116< 0.1%
 
-37.781816< 0.1%
 
-37.816715< 0.1%
 
-37.782915< 0.1%
 
-37.796415< 0.1%
 
-37.783915< 0.1%
 
-37.777615< 0.1%
 
-37.857115< 0.1%
 
-37.828815< 0.1%
 
-37.859715< 0.1%
 
-37.776515< 0.1%
 
Other values (13377)2646075.9%
 
(Missing)797622.9%
 
ValueCountFrequency (%) 
-38.190431< 0.1%
 
-38.18561< 0.1%
 
-38.184631< 0.1%
 
-38.184181< 0.1%
 
-38.184151< 0.1%
 
-38.182551< 0.1%
 
-38.181631< 0.1%
 
-38.179281< 0.1%
 
-38.178291< 0.1%
 
-38.177451< 0.1%
 
ValueCountFrequency (%) 
-37.39021< 0.1%
 
-37.39511< 0.1%
 
-37.39781< 0.1%
 
-37.399461< 0.1%
 
-37.403491< 0.1%
 
-37.40721< 0.1%
 
-37.407441< 0.1%
 
-37.407581< 0.1%
 
-37.408531< 0.1%
 
-37.408691< 0.1%
 

Longtitude
Real number (ℝ≥0)

MISSING

Distinct count14524
Unique (%)54.0%
Missing7976
Missing (%)22.9%
Infinite0
Infinite (%)0.0%
Mean145.00185113165432
Minimum144.42379
Maximum145.52635
Zeros0
Zeros (%)0.0%
Memory size272.4 KiB

Quantile statistics

Minimum144.42379
5-th percentile144.80008
Q1144.9335
median145.0078
Q3145.0719
95-th percentile145.1877
Maximum145.52635
Range1.10256
Interquartile range (IQR)0.1384

Descriptive statistics

Standard deviation0.1201687692
Coefficient of variation (CV)0.0008287395521
Kurtosis1.545947474
Mean145.0018511
Median Absolute Deviation (MAD)0.06832
Skewness-0.3948800169
Sum3897794.76
Variance0.01444053308
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
144.9966210.1%
 
144.99117< 0.1%
 
144.98517< 0.1%
 
145.010417< 0.1%
 
144.967916< 0.1%
 
144.991116< 0.1%
 
145.000116< 0.1%
 
145.024316< 0.1%
 
144.99715< 0.1%
 
144.999915< 0.1%
 
145.001715< 0.1%
 
145.010315< 0.1%
 
145.045115< 0.1%
 
144.997415< 0.1%
 
145.011615< 0.1%
 
144.999414< 0.1%
 
145.054514< 0.1%
 
145.011814< 0.1%
 
145.006414< 0.1%
 
145.032214< 0.1%
 
145.016114< 0.1%
 
145.001214< 0.1%
 
145.00814< 0.1%
 
144.992214< 0.1%
 
145.017814< 0.1%
 
Other values (14499)2650076.0%
 
(Missing)797622.9%
 
ValueCountFrequency (%) 
144.423791< 0.1%
 
144.431621< 0.1%
 
144.431811< 0.1%
 
144.43941< 0.1%
 
144.440511< 0.1%
 
144.485711< 0.1%
 
144.491< 0.1%
 
144.49261< 0.1%
 
144.5131< 0.1%
 
144.52061< 0.1%
 
ValueCountFrequency (%) 
145.526351< 0.1%
 
145.52371< 0.1%
 
145.511371< 0.1%
 
145.489851< 0.1%
 
145.482731< 0.1%
 
145.482461< 0.1%
 
145.47791< 0.1%
 
145.472821< 0.1%
 
145.472621< 0.1%
 
145.470521< 0.1%
 

Regionname
Categorical

Distinct count8
Unique (%)< 0.1%
Missing3
Missing (%)< 0.1%
Memory size272.4 KiB
Southern Metropolitan
11836
Northern Metropolitan
9557
Western Metropolitan
6799
Eastern Metropolitan
4377
South-Eastern Metropolitan
 
1739
Other values (3)
 
546
ValueCountFrequency (%) 
Southern Metropolitan1183634.0%
 
Northern Metropolitan955727.4%
 
Western Metropolitan679919.5%
 
Eastern Metropolitan437712.6%
 
South-Eastern Metropolitan17395.0%
 
Eastern Victoria2280.7%
 
Northern Victoria2030.6%
 
Western Victoria1150.3%
 
(Missing)3< 0.1%
 

Length

Max length26
Median length21
Mean length20.85477809
Min length3

Overview of Unicode Properties

Unique unicode characters21
Unique unicode categories (?)4
Unique unicode scripts (?)2
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
t10575514.5%
 
o9249712.7%
 
r7946810.9%
 
e7607610.5%
 
n691689.5%
 
a412015.7%
 
i354004.9%
 
348544.8%
 
M343084.7%
 
p343084.7%
 
l343084.7%
 
h233353.2%
 
S135751.9%
 
u135751.9%
 
s132581.8%
 
N97601.3%
 
W69141.0%
 
E63440.9%
 
-17390.2%
 
V5460.1%
 
c5460.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter61889585.1%
 
Uppercase Letter714479.8%
 
Space Separator348544.8%
 
Dash Punctuation17390.2%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
M3430848.0%
 
S1357519.0%
 
N976013.7%
 
W69149.7%
 
E63448.9%
 
V5460.8%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
t10575517.1%
 
o9249714.9%
 
r7946812.8%
 
e7607612.3%
 
n6916811.2%
 
a412016.7%
 
i354005.7%
 
p343085.5%
 
l343085.5%
 
h233353.8%
 
u135752.2%
 
s132582.1%
 
c5460.1%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
34854100.0%
 

Most frequent Dash Punctuation characters

ValueCountFrequency (%) 
-1739100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin69034295.0%
 
Common365935.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
t10575515.3%
 
o9249713.4%
 
r7946811.5%
 
e7607611.0%
 
n6916810.0%
 
a412016.0%
 
i354005.1%
 
M343085.0%
 
p343085.0%
 
l343085.0%
 
h233353.4%
 
S135752.0%
 
u135752.0%
 
s132581.9%
 
N97601.4%
 
W69141.0%
 
E63440.9%
 
V5460.1%
 
c5460.1%
 

Most frequent Common characters

ValueCountFrequency (%) 
3485495.2%
 
-17394.8%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII726935100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
t10575514.5%
 
o9249712.7%
 
r7946810.9%
 
e7607610.5%
 
n691689.5%
 
a412015.7%
 
i354004.9%
 
348544.8%
 
M343084.7%
 
p343084.7%
 
l343084.7%
 
h233353.2%
 
S135751.9%
 
u135751.9%
 
s132581.8%
 
N97601.3%
 
W69141.0%
 
E63440.9%
 
-17390.2%
 
V5460.1%
 
c5460.1%
 

Propertycount
Real number (ℝ≥0)

Distinct count342
Unique (%)1.0%
Missing3
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean7572.8883055029555
Minimum83.0
Maximum21650.0
Zeros0
Zeros (%)0.0%
Memory size272.4 KiB

Quantile statistics

Minimum83
5-th percentile2185
Q14385
median6763
Q310412
95-th percentile15510
Maximum21650
Range21567
Interquartile range (IQR)6027

Descriptive statistics

Standard deviation4428.090313
Coefficient of variation (CV)0.5847293839
Kurtosis0.8906876388
Mean7572.888306
Median Absolute Deviation (MAD)2823
Skewness0.9921002749
Sum263945449
Variance19607983.82
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
216508442.4%
 
88707222.1%
 
109695831.7%
 
149495521.6%
 
104124911.4%
 
145774851.4%
 
103314671.3%
 
105794561.3%
 
119184441.3%
 
148874351.2%
 
113084281.2%
 
113644241.2%
 
89204221.2%
 
78094201.2%
 
92644091.2%
 
112044051.2%
 
69383931.1%
 
74853781.1%
 
132403741.1%
 
86483711.1%
 
88013691.1%
 
77173361.0%
 
67953190.9%
 
56823190.9%
 
65433040.9%
 
Other values (317)2370468.0%
 
ValueCountFrequency (%) 
831< 0.1%
 
1211< 0.1%
 
1291< 0.1%
 
2421< 0.1%
 
2495< 0.1%
 
2711< 0.1%
 
2902< 0.1%
 
3351< 0.1%
 
3421< 0.1%
 
38913< 0.1%
 
ValueCountFrequency (%) 
216508442.4%
 
174962040.6%
 
17384200.1%
 
17093470.1%
 
170551230.4%
 
161661780.5%
 
155421290.4%
 
155102550.7%
 
153212350.7%
 
149495521.6%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

SuburbAddressRoomsTypePriceMethodSellerGDateDistancePostcodeBedroom2BathroomCarLandsizeBuildingAreaYearBuiltCouncilAreaLattitudeLongtitudeRegionnamePropertycount
0Abbotsford68 Studley St2hNaNSSJellis3/09/20162.53067.02.01.01.0126.0NaNNaNYarra City Council-37.8014144.9958Northern Metropolitan4019.0
1Abbotsford85 Turner St2h1480000.0SBiggin3/12/20162.53067.02.01.01.0202.0NaNNaNYarra City Council-37.7996144.9984Northern Metropolitan4019.0
2Abbotsford25 Bloomburg St2h1035000.0SBiggin4/02/20162.53067.02.01.00.0156.079.01900.0Yarra City Council-37.8079144.9934Northern Metropolitan4019.0
3Abbotsford18/659 Victoria St3uNaNVBRounds4/02/20162.53067.03.02.01.00.0NaNNaNYarra City Council-37.8114145.0116Northern Metropolitan4019.0
4Abbotsford5 Charles St3h1465000.0SPBiggin4/03/20172.53067.03.02.00.0134.0150.01900.0Yarra City Council-37.8093144.9944Northern Metropolitan4019.0
5Abbotsford40 Federation La3h850000.0PIBiggin4/03/20172.53067.03.02.01.094.0NaNNaNYarra City Council-37.7969144.9969Northern Metropolitan4019.0
6Abbotsford55a Park St4h1600000.0VBNelson4/06/20162.53067.03.01.02.0120.0142.02014.0Yarra City Council-37.8072144.9941Northern Metropolitan4019.0
7Abbotsford16 Maugie St4hNaNSNNelson6/08/20162.53067.03.02.02.0400.0220.02006.0Yarra City Council-37.7965144.9965Northern Metropolitan4019.0
8Abbotsford53 Turner St2hNaNSBiggin6/08/20162.53067.04.01.02.0201.0NaN1900.0Yarra City Council-37.7995144.9974Northern Metropolitan4019.0
9Abbotsford99 Turner St2hNaNSCollins6/08/20162.53067.03.02.01.0202.0NaN1900.0Yarra City Council-37.7996144.9989Northern Metropolitan4019.0

Last rows

SuburbAddressRoomsTypePriceMethodSellerGDateDistancePostcodeBedroom2BathroomCarLandsizeBuildingAreaYearBuiltCouncilAreaLattitudeLongtitudeRegionnamePropertycount
34847Wollert27 Birchmore Rd3h500000.0PIRay24/02/201825.53750.03.02.02.0383.0118.02016.0Whittlesea City Council-37.61940145.03951Northern Metropolitan2940.0
34848Wollert16 Gunther Wy4h621000.0Shockingstuart24/02/201825.53750.04.02.02.0375.0NaNNaNWhittlesea City Council-37.61331145.03412Northern Metropolitan2940.0
34849Wollert35 Kingscote Wy3h570000.0SPRW24/02/201825.53750.03.02.02.0404.0158.02012.0Whittlesea City Council-37.61031145.03393Northern Metropolitan2940.0
34850Wollert15 Rockgarden Wy3hNaNSPLJ24/02/201825.53750.03.02.02.0268.0135.02016.0Whittlesea City Council-37.61094145.04281Northern Metropolitan2940.0
34851Yarraville78 Bayview Rd3h1101000.0SJas24/02/20186.33013.03.01.0NaN288.0NaNNaNMaribyrnong City Council-37.81095144.88516Western Metropolitan6543.0
34852Yarraville13 Burns St4h1480000.0PIJas24/02/20186.33013.04.01.03.0593.0NaNNaNMaribyrnong City Council-37.81053144.88467Western Metropolitan6543.0
34853Yarraville29A Murray St2h888000.0SPSweeney24/02/20186.33013.02.02.01.098.0104.02018.0Maribyrnong City Council-37.81551144.88826Western Metropolitan6543.0
34854Yarraville147A Severn St2t705000.0SJas24/02/20186.33013.02.01.02.0220.0120.02000.0Maribyrnong City Council-37.82286144.87856Western Metropolitan6543.0
34855Yarraville12/37 Stephen St3h1140000.0SPhockingstuart24/02/20186.33013.0NaNNaNNaNNaNNaNNaNMaribyrnong City CouncilNaNNaNWestern Metropolitan6543.0
34856Yarraville3 Tarrengower St2h1020000.0PIRW24/02/20186.33013.02.01.00.0250.0103.01930.0Maribyrnong City Council-37.81810144.89351Western Metropolitan6543.0